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Abstract 

A method is presented for determining second virial coefficients (-B2) of pro- 
tein solutions from retention time measurements in size exclusion chromatography 
(SEC). We determine B2 by analyzing the concentration dependance of the chro- 
matographic partition coefficient. We show the ability of this method to track the 
evolution of B2 from positive to negative values in lysozyme and bovine serum al- 
bumin solutions. Our SEC results agree quantitatively with data obtained by light 
scattering. 

Introduction 

It is well known in size exclusion liquid chromatography (SEC) that the solute re- 
tention time depends sensitively on the solute's size, although no universal calibra- 
tion for SEC has yet been achieved. It has also been realized that thermodynamic 
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non- ideality leads to concentration dependent retention times (Nichol et al., 1978). 
Such dependence can be utilized to quantify the second osmotic virial coefficient, 
B 2 . 

For a non-ideal solution the osmotic pressure II can be written as a power series 
expansion in the solute number density p (Hill, 1960). 

^ = < . + apV + ... (i) 

In Eq^T is the absolute temperature and ks is Boltzmann's constant. All terms 
higher than first order in density represent non-ideality. 

The second virial coefficients of protein solutions have generated a great deal 
of interest since (George and Wilson, 1994) showed a correlation between protein 
crystallisability and Bi- Their work demonstrated that many proteins crystallize 
in conditions where the second osmotic virial coefficient becomes slightly nega- 
tive, indicating net attractive interactions between protein molecules. The most 
prevalent experimental procedure for measuring B2 is light scattering. Addition- 
ally, sedimentation equilibrium (Behlke and Ristau, 1999), osmometry (Moon at 
al., 2000), neutron (Velev et al., 1998) and x-ray scattering (Bonnete et al., 1999), 
and self-interaction chromatography (Tessier et al., 2002) have been employed to 
quantify protein solution non-ideality. 

(Nichol et al., 1978) showed the possibility of measuring B2 with frontal elu- 
tion liquid chromatography. Although frontal chromatography (Nichol et al.,1978, 
Wills et al., 1980) allows one to fix the solute concentration in the column directly, 
it requires a large amount of protein (~ 0.5 g) and long experiment times (about 
three hours per column run). In this study we extend their method to pulse size 
exclusion HPLC, where a small amount of protein is injected into and subsequently 
flows down the column. This adaptation drastically reduces the amount of protein 
(< 25 mg) and time needed (about 15 minutes per column run) to measure B2 by 
SEC. We show that our results for B2 obtained with size exclusion chromatogra- 
phy agree well with those from frontal chromatography and from light scattering 
measurements. We also demonstrate that SEC can track the evolution of B2 from 
positive to negative values. 
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Theory 

For the reader's convenience we reproduce the theory of (Nichol et al.,1978). We 
assume a balance of the solute, i.e. protein, chemical potentials (p p and pi) between 
the stationary and mobile phases as the solute is transported through the column. 
The pore volume (i.e. stationary phase) is labelled with the subscript p, and the 
inter-pore volume (i.e. mobile phase) with the subscript i. Equilibrium requires 
p p = p. L . We write these chemical potentials by including the standard part p°, 
the ideal term, and a term accounting for thermodynamic non-ideality through the 
activity coefficient 7: 

p p = p° p + RT ln(C p 7p (C p )) 
m = + RT \n(Ci 7i (Q)) 

where Cj p are the local solute weight concentrations, R is the universal gas constant 
and 7p(C p ), 7 i(Cj) are the thermodynamic activity coefficients of solute molecules 
in the pore and inter-pore volumes respectively. Rearrangement of these equations 
yields: 

1 / f \ ~ ^p 

HKo) = ~^ F - 
h (-) = ln ( ~ HKo) (2) 



where Kq is the partition coefficient of solute molecules between chromatographic 
phases in the limit of infinite dilution. The relation between weight concentration, 
C, and number density, p, is p = Cjf-. Na is Avogadro's number and M w is the 
solute molecular mass. (Nichol et al., 1978) made a virial expansion of the activity 
coefficients 

ln 7 (z) = 2B 2 {N A /M W )C + higher terms (3) 
We note that this consideration assumes no difference in the solute-solute interac- 



tions in the mobile and stationary phases. The local solute distribution coefficient 
is Kb = If Kb is independent of concentration, as is the case for pulse chro- 
matography with B2 = 0, or if the concentration is constant as in frontal elution 



chromatography, then (Nichol et al., 1978, Yau et al., 1979) 

where t r and V r are the solute retention time and volume, to an d Vq are the retention 
time and volume of completely excluded molecules (i.e. the "dead" volume), and 
tx and Vt the retention time and volume of completely included molecules (i.e. the 
"total" volume). Inserting the definition of Ku(Eq. |IJ and Eq. |3]mto Eq. [21 and 
keeping only the first order terms in concentration one obtains a relation between 
Kz>, B2, and Cj valid for frontal chromatography where the concentration Cj is the 
plateau value of the solute concentration in the mobile phase: 

ta G£) =2 *i£ £4(1 - jrri < 5 > 

To adapt this to pulse chromatography we replace the plateau value with the av- 
erage concentration < d > of the mobile phase in the pulse: 

'"(f) = 2B ^ <G>(1 - A ' D) < 6 > 

Since < d > is not directly accessible in a HPLC experiment one must relate 
it to measurable parameters. One determines the mass of solute molecules in the 
pulse, or migration zone, (m zone ) by integrating the concentration as a function of 
time over the zone volume, i.e. the peak(V z ). For our columns, in which there is no 
irreversible binding of protein molecules to the column, all the injected molecules 
are accounted for by integrating the peak. Therefore the total injected mass is the 
same as the total mass in the zone, m-^ = Cinj Mnj = ""izonej but the concentration of 
solute in the migration zone is much lower than the injected concentration because 
the pulse spreads as it is transported through the column. The condition for the 
conservation of mass of solute molecules in the migration zone (subscript z) is 

mi + m p = m in j (7) 
< Ci > Vi+ <C p >V p = CinjV-nj = m inj 

Here Vi and V p are the mobile (inter-pore) and stationary (pore) portions of the 
zone volume V z , with 

V p = (Vt-V )% ,Vi = (V Q )% (8) 
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We measure the solute zone volume V z from the full width At at half-maximum of 
the chromatogram peak using V z = uAt, where v is the average flow rate. After 
substituting the definition of the partition coefficient given in Eq. 0] and definitions 
Eq. [SJinto Eq. [71 one obtains: 



A simple way to understand Eq.|5]is to note that the numerator is the total mass in 
the zone and the denominator is the volume of the zone accessible to the protein. 
Thus, the concentration < Cj > is the ratio of these terms. In this derivation 
we have assumed Eq. |1] holds, which is no longer the case when both B2 ^ 
and the concentration is changing during transport down the column. However, 
as we will show below, the changes in Kjj with concentration are small, which 
may justify our approximation. This relation allows us to extend the method of 
(Nichol et al., 1978), originally developed using frontal elution chromatography, 
to pulse HPLC. Alternatively, one could use the maximum concentration C max of 
eluted solute instead of < Cj > in Eq. H3 As shown in Fig. ^ C max and < Cj > 
are almost equal. Our procedure is then to inject different volumes of samples at 
various concentrations, measure from the retention times as given in Eq. 0]and 
then plot lnKz) as a function of either < d > (1 — Kp) or C max (l — Kd). The 
slope of that plot is then 2 B2 Na/M w . 



Experimental 

Materials 

We obtained lysozyme (6x crystallized hen egg white), from Seikagaku America. 
Our studies, along with others' (Muschol and Rosenberger, 1997), of the purity 
of lysozyme preparations from Sigma and Seikagaku showed the Seikagaku to be 
purer and it was used without further purification. We obtained bovine serum al- 
bumin (BSA), from Sigma, and it was used without further purification. All buffer 
components were obtained from Fisher Scientific. A Millipore Elix system purified 
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water for all the experiments. We prepared potassium phosphate buffers by mixing 
50mM solutions of K2HPO4 and KH2PO4, at various NaCl concentrations to adjust 
the ionic strength, to reach the desired pH = 6.2 as measured by an Orion SA520 
pH meter. The pH = 4.7 of sodium acetate buffers was adjusted by adding concen- 
trated acetic acid to solutions of sodium acetate and NaCl. Additionally all buffers 
were passed through 0.45 /^m nylon filters, also obtained from Millipore, prior to 
use. Protein concentrations were measured using a Varian instruments Cary 50Bio 
spectrophotometer at a wavelength of 278 nm. The extinction coefficient used for 
lysozyme was e278nm = 2.64 ml (mgcm) -1 , and €278nm = -667 ml (mg cm) -1 for 
BSA. 

Chromatography 

An 1100 series liquid chromatography (HPLC) system from Agilent Technologies 
(Wilmington, DE) was used for all chromatographic measurements. Protein reten- 
tion times were determined using an Agilent differential refractive index detector 
(RID) and an Agilent diode-array-detector (DAD) by absorbance at 278nm. A 
TSK-G2000SW (30cm x 0.75cm I.D.) column from TosoBiosep and a YMC-Diol- 
200AMP (30cm x 0.60cm I.D.) column from YMC were used in the chromato- 
graphic measurements. We used a flow rate of 1 ml/min for all measurements. 
These columns contain a packing of porous silica beads whose surfaces have been 
hydrophilicly modified. From the manufacturer's specifications the diameter of a 
single bead is about 5/Lim for both columns. The average pore diameter is 125 
A for the TSK-G2000SW, and 200 A for the YMC-Diol-200AMP. We determined 
the SEC calibration curve for these columns by using poly-ethylene-glycol (PEG) 
samples with molecular weights 200 < M w < 10 5 g/mol, obtained from Sigma 
and Fluka. For every run the eluent was the same as the sample buffer. The 
random run-to-run difference in retention times for our system was < 0.1%. Any 
dependence of the dimensionless distribution coefficient Kd for protein molecules 
between the stationary and mobile phases on the average flow rate v would indicate 
non-equilibrium effects. We found Kn to be totally independent of flow rate for 
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the experimentally accessible values: O.lml/min < v < 1.3ml/min. 

Methods 

For each solvent condition we performed a series of HPLC experiments varying 
solute (protein) injected concentration Cj n j and using two injection volumes, Vjnj = 
20 and 100 fil. We identify the protein retention time t r as the time of the maximum 
in the RID signal (Fig. 1), where the injection time is t = 0. We plot t r as a function 
of Ci n j , and find that t r depends on V5 n j as shown in Fig. [21 In order to apply our 
modification of (Nichol et al., 1978) 's method to HPLC, we recalculate the average 
solute concentration in the peak zone, < C, >, as described in Eq. |UJ and find 
that this reassuringly collapses the multiple t r vs. Ci n j curves from Fig. |2] to a 
single curve as shown in the insert of Fig. |31 The slope of this collapsed curve is 
proportional to the second virial coefficient according to Eq. \5\ 

In order to calculate Kd according to Eq.^Jwe must measure the total (tx) and 
dead (to) times. We have measured the total time for each run using the solvent 
peak (these are maximums of the second peaks (tx) in Fig. In order to measure 
the dead time, we used PEG with a molecular weight of 10 5 g/mol, which is totally 
excluded from the TSK and the YMC columns. We have measured the dead times 
for all solvent conditions and injection volumes. It is important to measure tx and 
to separately for all injection volumes to avoid any instrumental errors associated 
with precisely identifying the injection time. 

We have performed light scattering measurements to determine B2 indepen- 
dently for a condition where results were not found in the literature. We employed 
the same method as in (George and Wilson, 1994) to measure the Rayleigh ra- 
tio of protein solutions using toluene as a standard at a scattering angle of 90 
degrees. In Eq. ^ B2 has the units of volume, but virial coefficients are often re- 
ported in units of ml mol/g 2 , which is denoted by A2 (George and Wilson, 1994). 
Then B2 = ^M^/iV^, where A^4 is Avogadro's number. Our results are shown in 
Table □ 
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Results 



We have measured the dependance of the retention factor Kq on C n j and Vi n j 
for lysozyme and BSA in the above mentioned buffers and columns. These buffer 
conditions were chosen to investigate the cross-over from positive to negative B2 
values and to compare with data available in the literature. 

Fig. ^ shows the RID signal measuring the concentration of the eluted protein 
versus time for representative lysozyme chromatograms with Vi n j = 20 /A. One 
can see the retention time increase with increasing protein concentration, while tx 
remains constant. 

In the size exclusion mode, the direction of the shift in the retention time 
with concentration depends on the sign of B2. For conditions where B2 > 0, t r 
increases with increasing protein concentration and where B2 < 0, t r decreases with 
increasing concentration. If B2 = 0, t r is independent of concentration. Previous 
studies (Velev et al., 1998, Muschol and Rosenberger, 1995, Gripon et al., 1997, 
Kulkarni, 1999) have shown that B2 for protein solutions depends on the ionic 
strength of the solution. 

Fig. |21 shows the dependence of lysozyme retention times on the injected con- 
centration Cjnj. The two sets of data correspond to different injection volumes 
(Vinj): 20 /A and 100 fA. Following the procedure introduced above for determining 
the average solute concentration in the mobile phase of the migration zone < C > , 
we plot the dimensionless retention parameter, Kjj, versus < Cj > in the insert 
of Fig. |j3 This procedure collapses the data from Fig. [21 onto a single curve from 
which Vi n j has been removed as an independent parameter. At the smallest con- 
centrations in the insert of Fig. |31 some non-linear dependence of Kd on < C% > 
can be observed. We attribute this behavior to errors introduced at the smallest 
signal to noise ratios. We have not included these points in our fits. 

In order to extract B2 from chromatographic data, one calculates < Cj > by 
Eq. and then plots lni^ versus < C > (1 — Kd) . Following (Nichol et al., 
1978), the slope of a linear fit to such a plot is then 2B2 Na/M w , as in Fig. |31 

The protein concentration range typically used to measure B2 by light scat- 
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tering is approximately < Cj < 30 mg/ml, (Velev et al., 1998, Muschol and 
Rosenberger, 1995). In our SEC measurements the protein concentrations < Cj > 
eluting from the column, correspond to precisely the same range, although the 
injected concentrations are much higher as show in Fig. [21 Even with these high 
concentrations, we never saturated our column. Such high injected concentrations 
may not be accessible for other protein systems, and may in fact be avoided by 
employing larger injection volumes, as shown by the Vi n j = 100/d data in Fig. [21 

In Fig.^Jwe compare our B2 results for BSA from pulse SEC and those obtained 
by (Nichol et al., 1978) using frontal chromatography. Our results show the same 
slope for InKjj as a function of < Cj > (1—Kjj) as those obtained by (Shearwin and 
Winzor, 1990), which means the B2 values are the same. The solution conditions 
for the two data sets differ, but other studies (George and Wilson, 1994, Moon et 
al., 2000) have shown that B2 for BSA is insensitive to many changes in solution 
conditions until crystallizing conditions are approached. Therefore we expect to 
measure a similar value of B2. We measured different values of Ke> than those in 
(Shearwin and Winzor, 1990) simply because we used a different column. 

In order to further validate the extraction of B2 from size exclusion chromatog- 
raphy (SEC), we compare our results to those obtained by light scattering in Fig. [21 
and in Tabled Fig. shows the dependence of the second osmotic virial coefficient 
on solution ionic strength (added NaCl concentration) for lysozyme. Our data 
agrees quantitatively with those previously obtained over a wide range of ionic 
strengths. Table ^ compares A2 values obtained in a different buffer, Potassium 
Phosphate 50mM pH 6.2. For this buffer our SEC measurements of A2 also agree 
with those from light scattering in their sign. The differences in magnitude can be 
attributed to systematic errors associated with light scattering and SEC measure- 
ments of A2, not statistical variation. Note that previously published results for 
A2 from various groups, as shown in Fig. [21 differ by as much or more than the 
values shown in Table ^ These results illustrate the ability of SEC to track the 
evolution of protein interactions from net repulsive A2 > to attractive A2 < 0. 
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Conclusion 



We have adapted the idea of (Nichol et al., 1978) and present measurements of 
protein second virial coefficients using the standard practice of size exclusion liq- 
uid chromatography, thereby reducing the cost in time and material of performing 
B2 measurements for protein solutions. After calculating the protein concentra- 
tion in the solute zone, our results agree with those previously obtained using an 
independent method, light scattering, in a number of other studies. 
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A 2 (10- 4 ml mol/g 2 ) 


NaCl (mM) 


From SEC 


From LS 





2.4 


1.8 


50 


1.6 




150 


-1.0 


-1.4 



Table 1: Comparison of size exclusion chromatography (SEC) and light scattering (LS) 
measurements of the second virial coefficients (10~ 4 ml mol/g 2 ) for lysozyme in Potas- 
sium Phosphate Buffer 50 mM, pH=6.2, at various added NaCl concentrations. 
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Figure 1: Lysozyme chromatograms for Vj n j = 20/d and different injected concentrations 
(Ci n j) as indicated next to each curve. The average concentrations < Cj > used in 
the analysis in Fig. El are dash - 0.74 mg/ml, points - 1.72 mg/ml, dash and points - 
2.20 mg/ml, long dash - 2.68 mg/ml, long dash and points - 3.08 mg/ml. The vertical 
line marks the retention time for the most dilute sample (not shown). The retention time 
t r is the time corresponding to the peak of the concentration profile (C max ) and increases 
with increasing concentration. The retention time of completely included molecules (the 
"total" volume) is marked as tr and is caused by the buffer. The retention time of 
completed excluded molecules (the "dead" volume) was to = 6.07 min (not shown in 
figure). Note that C max and < Cj > are similar. The buffer is Sodium Acetate, 50mM, 
pH 4.7. 
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Figure 2: Lysozyme retention times versus injected concentrations for two injection vol- 
umes. Set a : Vinj = 20/xl and set b : l^nj = 100/xl. Buffer : Sodium Acetate 50 mM pH 
4.7. 
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0.02 0.04 0.06 0.08 0.1 
Q [1-K D ]10~ 3 [mol/1] 

Figure 3: lnKp vs. < Ci > (1 — Xi))(mg/ml) for lysozyme as in Eq. El The insert 
is a plot of Kr, vs. < Ci >(mg/ml), where multiple curves from Fig. |2 with different 
injected volumes collapse after recalculating the solute concentration in the mobile phase 
of the migration zone as in Eq. Buffer: Sodium Acetate 50 mM pH 4.7, black dots: 
Vi n j = 100/xl, open circles: = 20/il. 
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Figure 4: Comparison of size exclusion chromatography (SEC) and frontal chromatog- 
raphy measurements for BSA. Open diamonds: BSA in Sodium Acetate 20mM, NaCl 
0.18M, pH=4.6, A 2 = 1.9 x 10" 4 ml mol/g 2 (Shearwin and Winzor, 1990). Black points 
are for BSA in Potassium Phosphate 50mM, pH=6.2, A 2 = 2.0 x 10 -4 ml mol/g 2 . In- 
jected concentrations are 1.14, 4.85, 10.05, 15.0, 20.7, 25.27, 30.44, 40.72, 50.99 mg/ml. 
Injection volumes are for squares: 2fA ; stars: 10/xl; triangles: 40/xl; diamonds: 100/xl. 
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Figure 5: Comparison of size exclusion chromatography (SEC) and light scattering mea- 
surements for lysozyme. A 2 (10 -4 ml mol/g 2 ) versus NaCl concentration. Buffer: Sodium 
Acetate 50 mM pH 4.7. The SEC measurements are denoted by open pentagons. The 
data for 0, 100, 200, and 300 mM NaCl were taken on a TSK column, and the data for 
400 mM NaCl was with a YMC column. The remaining data comes from published light 
scattering data. Black diamonds: (Gripon et al., 1997), black triangles: (Velev et al., 
1998), black stars: (Kulkarni, 1999), black rectangles: (Muschol et al., 1995) 
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